Experiments in adaptation of language models for commercial applications

نویسندگان

  • Petra Witschel
  • Harald Höge
چکیده

To improve recognition accuracy for large vocabulary speech recognition systems we use language models based on linguistic classes (extended POS). In this paper an adaptation technique is presented, which profits from linguistic knowledge about unknown words of new domain. Switching from basis domain to new domain we keep the bigram probabilities of linguistic classes fixed and adapt only monograms of word probabilities. In our experiments we use three different corpora: financial columns of a newspaper corpus and two medical corpora (computer tomography and magnetic resonance). Adapted language models show an improvement of test-set perplexity of 48% to 51% compared to the case of putting unknown words into the language model " unknown " class.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of adaptation and grain yield stability of durum wheat (Triticum turgidum L.) genotypes in temperate agro-climate zone of Iran

Identification of adapted genotypes with high grain yield is the most important goal in durum wheat breeding programs. To study adaptation and grain yield stability of durum wheat genotypes, 18 durum wheat promising lines with two commercial durum and bread wheat cultivars were used. The durum wheat genotypes were evaluated in four locations; Isfahan, Karaj, Kermanshah and Neishabour in tempera...

متن کامل

Dictionary of Abstract and Concrete Words of the Russian Language: A Methodology for Creation and Application

The paper describes the first stage of a project on creating an electronic dictionary with numerical estimates of the degree of abstractness and concreteness of Russian words. Our approach is to integrate data obtained from several different sources: text corpora, psycholinguistic experiments, published dictionaries, markers of abstractness (certain suffixes) and a translation of a similar dict...

متن کامل

EFL Classroom Discourse in Iranian Context: Investigating Teacher Talk Adaptation to Students’ Proficiency Level

How language teachers talk is a key factor in organizing and facilitating learning specifically in language classrooms where the medium of instruction is also the subject matter. This study aimed to examine the extent and ways of teacher talk adaptation to students’ proficiency levels in the Iranian EFL context. Two EFL teachers who were teaching three different proficiency levels were observed...

متن کامل

English Language Teaching Material Development

The goal of language programs is to utilize language for effective communication. Due to the needs, interests, and motivations of language learners, they may show individual differences in their lan- guage learning. Materials used in language programs can be instructional, experiential, elucidative, or exploratory in that they can inform learners about the language, provide experience of the la...

متن کامل

ارزیابی ارگونومیک طرح جدید پوست کن دستی بر اساس روش الکترومیوگرافی

  Background and aims : In our daily lives hand tools are often used in many work situations. With respect to design characteristics, hand tools can be considered a risk factor when a high level of repetition are required or when awkward postures are adopted. Therefore attention to ergonomics design rules for hand tools is the most important factor in design process. With this in mind, the aim ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997